Surface Realisation from Knowledge-Bases
نویسندگان
چکیده
We present a simple, data-driven approach to generation from knowledge bases (KB). A key feature of this approach is that grammar induction is driven by the extended domain of locality principle of TAG (Tree Adjoining Grammar); and that it takes into account both syntactic and semantic information. The resulting extracted TAG includes a unification based semantics and can be used by an existing surface realiser to generate sentences from KB data. Experimental evaluation on the KBGen data shows that our model outperforms a data-driven generate-and-rank approach based on an automatically induced probabilistic grammar; and is comparable with a handcrafted symbolic approach.
منابع مشابه
Statistical Surface Realisation of Portuguese Referring Expressions
Natural Language Generation systems usually require substantial knowledge about the structure of the target language in order to perform the final task in the generation process – the mapping from semantic representation to text known as surface realisation. Designing knowledge bases of this kind, typically represented as sets of grammar rules, may however become a costly, labour-intensive ente...
متن کاملCreating Training Corpora for NLG Micro-Planning
In this paper, we present a novel framework for semi-automatically creating linguistically challenging microplanning data-to-text corpora from existing Knowledge Bases. Because our method pairs data of varying size and shape with texts ranging from simple clauses to short texts, a dataset created using this framework provides a challenging benchmark for microplanning. Another feature of this fr...
متن کاملExtracting Surface Realisation Templates from Corpora
In Natural Language Generation (NLG), template-based surface realisation is an effective solution to the problem of producing surface strings from a given semantic representation, but many applications may not be able to provide the input knowledge in the required level of detail, which in turn may limit the use of the available NLG resources. However, if we know in advance what the most likely...
متن کاملText-to-Text Surface Realisation Using Dependency-Tree Replacement
Surface realisation the task of producing word strings from non-linguistic input data has been the focus of a great deal of research in the field of data-to-text Natural Language Generation (NLG). In this work we discuss an alternative approach to surface realisation, in which we borrow NLG techniques from the sister field of text-to-text generation to implement text generation based on example...
متن کاملText Generation in a Dynamic Hypertext Environment
This paper describes PEBA-II, a working natural language generation system which interactively describes animals in a taxonomic knowledge base via the production of World Wide Web pages. Our aim is to construct a natural language document generation system with real practical applicability: to this end, the system reconstructs and combines a number of existing ideas in the literature in a novel...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2014